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Abstract 

In order to study the phenomenon in detail that income distribution foUows Pareto law, 
we analyze the database of high income companies in Japan. We find a quantitative relation 
between the average capital of the companies and the Pareto index. The larger the average 
capital becomes, the smaller the Pareto index becomes. From this relation, we can possibly 
explain that the Pareto index of company income distribution hardly changes, while the 
Pareto index of personal income distribution changes sharply, from a viewpoint of capital 
(or means) . We also find a quantitative relation between the lower bound of capital and the 
typical scale at which Pareto law breaks. The larger the lower bound of capital becomes, 
the larger the typical scale becomes. From this result, the reason there is a (no) typical 
scale at which Pareto law breaks in the income distribution can be understood through (no) 
constraint, such as the lower bound of capital or means of companies, in the financial system. 
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1 Introduction 

It is well known that personal high income distribution follows power law [1]: 

Ap(>x)ocx~" (1) 

with high accuracy [2]. Here x is the income and Np[> x) is the number of persons with the 
income larger than or equals to x, and the power law and the exponent a is called Pareto law 
and Pareto index, respectively. 

For instance in Japan, persons who paid annual income tax more than 10 million yen are 
announced publicly as "high income taxpayers" every year. It is reported that their income 
distribution follows Pareto law [2]. In Ref. [3], it is also reported that Pareto index annually 
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changes around 2.0, and especially changed sharply (1.8 =^ 2.1) when the economic bubble 
collapsed in Japan. 

In addition, it is well known that, not only the personal high income distribution, but also the 
company high income distribution follows Pareto law. For instance in Japan, companies having 
annual income more than 40 million yen are announced publicly as "high income companies" 
every year. It is reported that their income distribution follows Pareto law and the Pareto index 
hardly changes around 1.0 annually [4, 5, 6]. Power law with exponent —1 is especially called 
Pareto-Zipf law. Furthermore in Ref. [4, 5], it is reported that income distributions in most job 
categories follow Pareto law, however, those Pareto indices scatter around 1.0. 

On the other hand, it is well known that personal or company distribution with low-middle 
income region docs not follow Pareto law [7, 8, 2, 9]. 

The research on income distribution is well investigated in cconophysics [10], and many mod- 
els to explain the income distribution are proposed (For instance [3, 5, 6, 9]). There are, however, 
several unsolved problems. The first biggest question is why Pareto law appears (breaks). We 
want to clarify the reason that the high income distribution follows Pareto law, while the low- 
middle income distribution does not follow Pareto law. The second question is why Pareto index 
changes (does not change) . For instance in recent annual income distribution in Japan, we want 
to understand the reason that Pareto index of personal high income distribution changes sharply, 
while Pareto index of company high income distribution hardly changes. The third question is 
whether there is any economical variable related to Pareto index. If we find an answer to the 
third question, we may be able to solve the second and the first questions. 

In order to clear these problems, we should examine Pareto law in detail. In this paper, we 
analyze the database of high income companies in recent Japan. The reason is as follows. The 
information in the database of high income taxpayers is limited, only income tax, individual 
name and the address. The database of high income companies, however, includes rich financial 
information, not only income, company name and the address, but also capital, number of em- 
ployees, sales and profits. We analyze the database published by TOKYO SHOKO RESEARCH, 
LTD. [11]. 

By analyzing this database, we obtain the following several relations between Pareto law 
and capital. Firstly, by classifying companies into job categories, we find a quantitative relation 
between the average capital of companies in each job category and the Pareto index. The larger 
the average capital becomes, the smaller the Pareto index becomes. 

Secondly, we classify companies into capital categories. We observe that even high income 
distribution follows power law only in the large-scale region and deviates further from the power 
law in the smaller-scale region. This distribution has the same feature to the distribution with 
low-middle income region. In this capital classification, we also observe that the larger the 
capital class becomes, the smaller the Pareto index in the power law region becomes. This 
means that the relation between the average capital and the Pareto index, found by classifying 
companies into job categories, holds regardless of the job classification. In addition, wc observe 
that the larger the capital class becomes, the larger the typical scale at which Pareto law breaks 
becomes. In this paper, we call the scale at which Pareto law breaks the typical scale [9]. 
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Thirdly, we classify companies into cumulative capital categories. We observe that, in this 
classification, each Pareto index in power law region can be estimated to be about 1.0. When 
Pareto index is fixed in each class, the typical scale can be read systematically. As a result, 
we find a quantitative relation between the lower bound of cumulative capital category and 
the typical scale. The larger the lower bound of capital becomes, the larger the typical scale 
becomes. 

Prom these analytic results, we obtain the following conclusions. Firstly, the average capital 
is related to the Pareto index. Prom this relation, the small change of Pareto index can be 
interpreted as small fluctuations in the average capital. Secondly, the typical scale is related to 
the lower bound of capital. As a result, the reason Pareto law appears in the income distribution 
can be understood through no constraint, such as the lower bound of capital (or means), in the 
financial system [9]. 

2 Income distribution in job categories 

In this Section, we classify Japanese high income companies into job categories, and explain 
that the average capital of the companies in each job category is related to the Pareto index. 
Before the analysis, we describe our database. 

In Japan, companies having annual income more than 40 million yen are announced publicly 
as "high income companies" every year. For example, we can obtain the lists of all the companies 
in the database published by DIAMOND INC. which is supphcd from TEIKOKU DATABANK, 
LTD. (TDB). The information for each company in the database is, however, limited, only 
company's name, its job category and the income. In this paper, we analyze the database, 
"CD Eyes" published by TOKYO SHOKO RESEARCH, LTD. (TSR) [11] instead of the TDB 
database. The TSR database includes rich financial information, not only company's name, its 
job category and the income, but also capital, number of employees, sales and profits. We should 
note that these detailed pieces of information are not added to all the high income companies 
announced publicly. The number of the high income companies to which the detailed information 
is added in this database is smaller than the number of the high income companies annoTinced 
publicly. However, we consider that there is a sufficient amount of data in this database to 
investigate Pareto law and Pareto index. 

For example, we compare two income distributions of Japan in the fiscal year 2002 (Fig. 1). 
One is obtained from the TDB data which includes all the high income companies announced 
publicly, the other is obtained from the TSR data which does not include all the companies. The 
horizontal axis indicates the income x in units of thousand yen and the vertical axis indicates 
the number of companies with the income larger than or equals to x {Np{> x) in Eq. (1)). 
There is almost no difference in two distributions except for a parallel gap along the vertical 
axis. Moreover, if we estimate Pareto index up to 2 figures of significant figures, the difference 
between both Pareto indices can be disregarded. Here, we estimate the Pareto index for each 
data by excluding top 0.1% and bottom 10% and using least-square-fit. 

We also show several income distributions in typical job categories for the TRS data in Fig. 2. 
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Similar distributions arc also obtained from the TDB data [4, 5]. We estimate the Pareto index 
of the distribution in each job category by excluding top 1% and bottom 10%. Therefore, we 
consider that the TSR data is useful to investigate Pareto law and Pareto index. 

We estimate Pareto index of power law distribution in every job category using the TSR data 
in the fiscal year 2002. There are exceptions, for example, the distribution for power companies 
in Fig. 2 should be considered to deviate from power law. Such deviations from power law can 
be measured by coefficient of determination in least-square-fit. In the TSR database, we can 
know the capital of each company, therefore, we also calculate the average capital of companies 
in each job category (C). We show the relation between the average capital and the Pareto 
index (Fig. 3). The horizontal axis indicates the average capital in units of thousand yen and 
the vertical axis indicates the Pareto index (a). Here, we exclude several job categories in which 
income distribution does not follow power law. We decide the exclusion by the criterion that 
the coefficient of determination is less than 0.97. In Fig. 3, we observe that the larger the 
average capital of each job category becomes, the smaller the Pareto index becomes, and we 
quantitatively find 

a- -0.311ogC'-h2.7 . (2) 

Is this relation peculiar to the job classification? In order to examine this question, in the 
next Section, we classify companies into capital categories regardless of the job category. 

3 Income distribution in capital categories 

In this Section, we classify Japanese high income companies into capital categories such as 
0—1, 1 — 10, 10 — 100 and 100— in units of 100 million yen. In this classification, we find that 
even high income distribution has not only the power law region but also the region which does 
not follow the power law (Fig. 4). In Fig. 4, we can observe that the larger capital class becomes, 
the smaller the Pareto index in power law region becomes. This qualitative relation is similar to 
Eq. (2) obtained in the job classification. Consequently, we consider that the relation between 
the average capital and the Pareto index (2) is directly nothing to the job classification. We also 
observe that the larger capital class becomes, the larger the typical scale becomes in Fig. 4. 

In Fig. 4, however, it is difficult to read Pareto index and typical scale quantitatively, because 
it is hard to determine the power law region and the typical scale. In order to avoid this difficulty, 
in the next Section, we classify companies into cumulative capital categories. 

4 Income distribution in cumulative capital categories 

Prom the distributions in Fig. 4, we suppose that companies in the large capital class leads 
Pareto-Zips law in the large-scale region. In order to confirm this, we classify companies into 
cumulative capital categories such as 0—, 0.32—, 1—, 3.2—, 10— and 32— in units of 100 million 
yen. In this classification, we find that each Pareto index in power law region can be estimated 
to be about 1.0 (Fig. 5). 
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When Parcto index is fixed in each class, the typical scale can be read systematically. We 
read the scale by using the distribution functions [13, 9] derived in 2-dim gravity model [12] 
(See Appendix in detail). As a result, we observe that the larger the lower bound of capital 
in each cumulative capital category {Cmin) becomes, the larger the typical scale (w) becomes 
(Fig. 6). In Fig. 6, we find a quantitative relation: 

w ~ 4.1 . (3) 

In this relation, when there is no bound with respect to capital {Cmin = 0); there is no 
typical scale {w = 0) and the income distribution follows Pareto law in all scale region. Eq. (3) 
is, therefore, consistent with the observation that almost all distributions follow Pareto law in 
the job classification in Section 2. 

5 Discussion 

In this paper, by analyzing the database of high income companies in the fiscal year 2002 
Japan, wc obtained the following two results. Average capital is related to the Pareto index 
(Eq. (2)). Typical scale at which Pareto law breaks is related to the lower bound of capital 
(Eq. (3)). 

Firstly, from Eq. (2), the small change of Pareto index can be interpreted as small fluctuations 
in the average capital. If we assume that similar equation to Eq. (2) holds in other years, we 
may explain that the Pareto index of company income distribution hardly changes. In addition, 
if we assume that capital and assets have a relation and similar equation to Eq. (2) holds in 
personal income distribution, we may also explain that the Pareto index of personal income 
distribTition changes sharply by large fluctuations in average means. 

Secondly, from Eq. (3), the reason there is a (no) typical scale at which Pareto law breaks 
down in the income distribution can be understood through (no) constraint, such as the lower 
bound of capital or means, in the financial system [9]. 

There is an unsolved problem: why the exponent of the power law distribution of all high 
income companies is 1. It is reported that the Pareto index in Italy, Spain, France and UK also 
hardly changes around 1 [6] . It is natural to consider that there is some mechanism which fixes 
Pareto index at 1 universally. The relation between the average capital and the Pareto index (2) 
may be used to study this problem. We should, however, determine the normalization of Eq. (2) 
to study this problem. Although Pareto index a in the left side of Eq. (2) is dimensionless, 
the average capital C in the right side has monetary dimension. Since logarithm of the average 
capital is taken, Eq. (2) is sufficient to estimate change of the Pareto index with respect to the 
average capital. When we estimate magnitude of the Pareto index, however, we must divide the 
average capital by some criterion with monetary dimension and determine the normalization. It 
is difficult to fix this normalization only by analyzing a single fiscal year. 

We should examine the relation between the average capital and the Pareto index over several 
years to complete Eq. (2) up to the normalization. This remains to study. 
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Appendix A 2-dim gravity model 

In Section 4, we used the distribution functions [13, 9]: 



nw{x) ~ N\Y x'^° ^ exp 



for <x <w , (A.4) 



w 

np{x) ~ Np for u; < x < oo (A.5) 



derived in 2-dim E? gravity model [12] in order to read the typical scale at which power law 
breaks down. Here 70 and 700 are constants determined by the parameter c, 

c- 12 

70 = 2 + — g— , (A.6) 
7» = ,^ ^-25-V(25-.)(l- ^, (^.,) 

In the power law region of the company income distribution, we observed that the exponent 
7oo — 2 of power law (A.5) is —2, then we decided that c = 1 and 70 — 2 = —11/6. We should 
note that the exponent of power law is —1 when the number of companies is accumulated, 

Np{'> x) = dx np{x) oc . The typical scale is w, which connects two distribution 
functions (A.5) and (A.4). Wc read the scale w by fitting the distribution function (A.4) to the 
region where the distribution does not follow power law [9]. 

The first reason wc use 2-dim gravity model, it can derive both power law distribution 
and distribution which does not follow the power law in a unified framework. The typical scale 
at which power law breaks, therefore, is clearly understood in this model. We introduce a scale 
w into the model, which is originally scale free Sfree, by adding an interaction term with a scale 
Sscale [12]: 

Sfree{X';g) = ^ j d'^x^g'"' d^X'd.Xi , (A.9) 

Sscaieig) = -^J ^^^VgR^ ■ (A.io) 

Here, Stotal is the total action, X*(z = 1,2, •••,c) are conformal scalar matter fields coupled 
with gravity, gni,{^JL,v = 0,1) is the metric of 2-dim surface, R is the scalar curvature and w 
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is a coupling constant of length dimension 2. The partition function for fixed area A of 2-dim 
surface is 

^(^^ = / ^^f''""^''^'''^ I ^'^^ - ^) • ^^-^^^ 

By using this partition function, the statistical average number of finding a surface of area B 
on a closed surface of area A can be expressed as Eqs. (A. 4) and (A. 5) [12, 14, 13, 9]. Here we 
used expression (1 — B/A)B = x. 

The distribution functions (A. 4) and (A. 5) are geometrically understood as follows. The 
scale free action S free leads the locality that local structure of 2-dim surface does not affect the 
2-dim structure away from that place. Prom this property, it is well known that a typical 2-dim 
surface has fractal self-similar structure (Fig. 7) [15, 16]. On the other hand, the scale variant 
action Sgcale the effect to let 2-dim surface flat. When this effect cannot be ignored, local 
structure of 2-dim surface does affect the 2-dim structure away from that place. The locality of 
2-dim surface breaks down and some kind of constraint is imposed on 2-dim surface. Therefore, 
the statistical average number of finding a small surface of area B on a closed surface of area A 
decreases. As a result, the fractal structure in the small area region breaks down and the typical 
scale is introduced into the system [12, 13]. 

This 2-dim geometric phenomenon has qualitative common features to the phenomenon in 
the company income distribution. When there is lower bound of capital, the company income 
whose capital is below the lower bound is hardly counted in the company income distribution 
(Fig. 8). As a result, the small income distribution does not follow Pareto law and the scale at 
which fractal power law breaks is introduced into the income distribution. From this qualitative 
similarity between two kinds of distributions, wc used the distribution functions derived in 2-dim 

gravity model as measuring tool to read the typical scale in the income distribution [9]. 
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Figure 1: Company income distributions in the fiscal year 2002 Japan. 
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Figure 2: Company income distributions in typical job categories. 
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Figure 3: Relation between the average capital of companies and the Pareto index. 
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Figure 4: Company income distributions in capital categories. 
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Figure 5: Company income distributions in cumulative capital categories. 
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Figure 6: Relation between the lower bound of capital and the typical scale. 
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Figure 7: A typical 2-dim surface is fractal. 
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Figure 8: Relation between the capital and the income of high income companies. 
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